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PATENT 

XYLOGLUCAN FUCOSYLTRANSFERASES 

FIELD OF THE INVENTION 

This invention is in the field of molecular biology. In particular, this 
invention relates to the isolation, purification, cloning and expression of plant 
xyloglucan fucosyltransferases. 

This application claims priority from US provisional serial no. 60/117,555 
filed January 18, 1999 and US application serial no. 09/490,521 filed January 25, 
2000. 

BACKGROUND OF THE INVENTION 

In most multicellular organisms, cells are embedded in an intricate 
extracellular matrix that keeps them together and influences the shape, 
development, and polarity of the cells they contact. Animal cells have such an 
extracellular matrix at their surface, but plants possess a distinct wall that 
encloses every cell. Many important differences between plants and animals with 
respect to nutrition, digestion, growth, reproduction, and defense mechanisms can 
be traced to the plant cell wall. Cell walls are mediators of growth, which in plants 
is determined largely by the wall extensibility provided that sufficient turgor 
pressure is present. Morphogenesis is also effected by the cell wall at the tissue 
and cellular levels. The biosynthesis of plant cell walls must be very tightly 
regulated. Although an individual plant cell may expand its volume by as much as 
18,840 times, its cell wall must maintain a regular thickness and uniform structure 
to prevent hemorrhaging of the cell contents due to the high internal turgor 
pressure. However, despite extensive descriptions of the chemical and physical 
structure of the plant cell wall, very little is known about its biosynthesis. Only one 
cell wall-synthesizing glycosyltransferase, cellulose synthase, has been cloned 
and described in any detail 

Plant cell walls are mainly composed of cellulose microfibrils and matrix 
polysaccharides. Hemicellulose is a type of matrix polysaccharide that binds 
tightly but noncovalently to cellulose microfibrils, helping to crosslink them into a 
complex network. Xyloglucan is a fundamentally important hemicellulose in dicot 
and nongraminaceous monocot plants. It comprises approximately 25% of the 
total cell wall and forms a load-bearing network by associating with the faces of 
surrounding cellulose microfibrils via hydrogen bonds. Xyloglucan contains a 
beta-1 ,4-glucan backbone decorated with side chains of xylose alone, xylose and 
galactose, and xylose, galactose and fucose. The presence or absence of the 
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fucose residue is thought to determine whether the xyloglucan conformation is 
planar and thus better able to bind to cellulose, a critical step in cell wall formation. 
In addition, oligosaccharides consisting of a monomer of xyloglucan have been 
shown to prevent auxin-promoted elongation of pea stems when the 
5 oligosaccharides contain fucose, but not if they lack fucose suggesting that 

xyloglucan fragments act as signalling molecules in vivo. Xyloglucan fucosylation 
is thus a critical step in plant development. 

There is thus a need to identify the genes and gene products involved in 
plant xyloglucan fucosylation. In particular, there is a need to isolate, purify and 
10 clone xyloglucan fucosyltransferase genes and gene products so that xyloglucan 
fucosylation may be controlled and regulated in plants and other organisms. 




SUMMARY OF THE INVENTION 

In order to meet these needs, the present invention is directed to purified, 
1 5 isolated, sequenced and cloned plant xyloglucan fucosyltransferase. In addition, 
the present invention is directed to the purification, isolation, sequencing and 
cloning of plant xyloglucan fucosyltransferase. The present invention is further 
directed to transgenic organisms expressing plant xyloglucan fucosyltransferase. 
The present invention is further directed to transgenic plants expressing regulated 
20 levels of xyloglucan fucosyltransferases. 

^)\J> In genekal, the invention features substantially pure fucosyltransferase 
DNA or protein obtained from a plant. In a related aspect, the invention features a 
fragment or analog polypeptide including an amino acid sequence substantially 
identical to the sequences shown in Tables 2, 5 and 6. 
25 In another Related aspect, the invention features substantially pure DNA 

j^JD^iving a sequence substantially identical to the nucleotide sequence shown in 
0 ^ Tables 3-14. In preferred embodiments, such DNA is cDNA or is genomic DNA. 
In related aspects, |he invention also features a vector and a cell (e.g., a plant) 
which includes sucft substantially pure DNA. In various preferred embodiments, 
30 the vector-containirra cell is a prokaryotic cell, for example, E. coli or 
Agrobacterium or, more preferably, a plant cell. 

3j? In yet andjther related aspect, the invention features a method of 
fucosylating a powpeptide in vivo involving: (a) providing a cell containing the 
fucosyl transferase DNA of the invention positioned for expression in the cell; and 
35 (b) culturing the transformed cell under conditions for expressing the DNA, 
resulting in the fucosylation of the protein. In preferred embodiments, 
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fucosylation occurslin a plant cell. 
£^J^> In anoth€ r aspect, the invention features a recombinant polypeptide 
fucosylated usin } a cell expressing DNA which is substantially identical to the 
nucleotide sequ< nee shown in Tables 3-14. In still other preferred embodiments, 
the polypeptide is further fucosylated using one or more fucosyl transferases. 
The present invention further includes multiple types of DNA constructs 
eluding (1) "sense" constructs encoding proteins, which can increase the 
expression of fucoysltransferases in plant species and (2) "antisense" constructs 
containing DNA, which can be used to produce antisense RNA in to reduce 
expression of fui :osyltransfereases in plants. Optimal amounts of antisense RNA 
in transgenic plaits will selectively inhibit the expression of genes in these plants 
which are involved in the fucoylsation of xyloglucans. 

Some of these constructs will direct constitutive production of transcripts. 
Other constructs will direct expression in specific organs and/or specific tissue 
layers of the transgenic plant. These organs will include leaves, petioles, stems, 
flower organs, seeds, fruits or photosynthetically active parts of the plant. Tissue 
layers will include but may not be restricted to the epidermis and adjacent cell 
layers. 

The present invention also provides recombinant cells and plants 
cpntaining these constructs. 

In one Embodiment, the first category of DNA constructs include: a 
promoter selected from but not limited to constitutive, tissue-specific, cell-type 
specific, seed-specific, flower-specific, fruit-specific, epidermis-specific promoters, 
a promoter specific to cell layers adjacent to the epidermis or a promoter specific 
to photosyntheticplly active plant tissues, which functions in plant cells to cause 
an RNA sequence. In this embodiment, the DNA coding region 
code proteins which can be used to increase the activity of 



20 contai 



the production of 
sequences that 
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plant fucosyl transferases in transgenic plants. The DNA coding region will further 
include a region 3' to the coding regions the 3' non-translated region which 
functions in plant c ells to cause the addition of polyadenylate nucleotides to the 3' 
end of the RNA sequence promoter. 

In another embodiment, a second category of DNA construct will include a 
constitutive promot sr, seed-specific, flower-specific, fruit-specific, epidermis- 
promoter specific to cell layers adjacent to the epidermis or a 
photosynthetically active plant tissues, which functions in 
he production of an RNA sequence. The DNA construct will 




specific promoter, a 
promoter specific tc 
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also include DNA] sequences which can produce antisense RNA molecules. 
These RNA molecules can selectively inhibit the accumulation of transcripts 
encoding proteins which encode plant fucosyl transferases. 

In accordance with another aspect of the present invention, there is 
provided a method of producing genetically transformed plants which express a 
gene or genes involved in fucosyl transferase activity. In this method, a 
recombinant, double-stranded DNA molecule into the genome of a plant cell. In 
this embodiment, the DNA sequence will include a promoter which functions in 
plant cells to causelthe production of an RNA sequence in flowers, seeds, fruit or 
other plant tissues. |n addition, the sequence will include a DNA coding sequence 
encoding proteins involved in fucosyl transferase activity in plants. Alternatively, 
the sequence will baa template to the synthesis of antisense RNA inhibiting the 
development of these structures. The DNA sequence will also include a 3' non- 
translated region wiich functions in plant cells to cause the addition of 
polyadenylate nucleotides to the 3' end of the RNA sequences. The method also 
includes obtaining transformed plant cells and regenerating from the transformed 
plant cells geneticallyjtransformed plants. The transformed plant cells may be 
used to overproduce in cell culture the fucosylated xyloglucans. 

The present invention is also directed to transgenic cells such as yeast, 
fungi, mammalian, and the like cells expressing the DNA sequences of this 
invention. The present invention is also directed to purified fucosylated 
xyloglucans isolated from the transgenic cells of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



This invention will be 



which: 



all fractions from an anion exch* 
xyloglucan-specific fucosyltrans 
incorporated into product per se 
column eluate. 



better understood by reference to the figures, in 



Figure 1 shows the bioc lemical purification of xyloglucan-specific 
fucosyltransferase from pea. Figure 1A shows a silver-stained SDS-PAGE gels 
showing protein profiles from ca 'bonate-washed pea epicotyl extract, peak 
fractions from a GDP-agarose and size exclusion column, and the entire profile of 



nge column. Figure 1 B shows a 

erase activity, in nKat (= nMoles of substrate 

x>nd), for each fraction of the anion exchange 
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Figure 2 shows con 



activity. Figure 2A shows that Anti-AtFT1 polyclonal antibodies recognize an 



approximately 62 kDa polyp 



Lane 3, pea (40 :l.) Left, We 



irmation of Arabidopis fucosyltransferase (Ftase) 



ptide in solubilized membrane proteins of 



Arabidopsis, but not pea. Lane 1 , antigen (50 ng); Lane 2, Arabidopsis (40 :l), 



tern blot. Right, Commassie staining of membrane 



showing protein profiles. Fk ure 2B shows that anti-FTase antibodies 
immunoprecipitate more XG specific FTase activity than an equal volume of 
preimmune serum. Shown is FTase activity, in nanokats, of precipitated pellets 
that were subsequently subje cted to FTase activity assays. This is an example 
similar to results seen in sevon different replicates. Figure 2C shows that a 
full-length AtFTase which wa 5 expressed in a COS cell line shows XG-specific 
FTase activity. Activity in nKfat is shown in the presence or absence of tamarind 



XG for untransformed Cos-7 
cells transformed with vecto 
Golgi vesicles. 



cells, cells transformed with vector DNA (pCDNA3), 
containing AtFT1 (pCDNA-XGFT), or solubilized pea 



Figure 3 shows a hy drophobicity plot of AtFT1 . 



Figure 4 shows a diagram of plasmid pMEN020. 



. Ft 



DEFINITIONS 

To ensure a complete understanding of the invention, the following 
definitions are provided: 

25 Xyloglucan: . Xyloglucan is a hemicellulose carbohydrate present in dicot 

and nongraminaceous monocot plants comprising approximately 25% of the total 
cell wall and forming a load-bearing network by associating with the faces of 
surrounding cellulose microfibrils via hydrogen bonds. Xyloglucan contains a 
beta-1 ,4-glucan backbone decorated with side chains of xylose alone, xylose and 

30 galactose, and xylose, galactose and fucose. 



0^ 

O Ftase 



Xyloglucan fyetfsyltransferase: Xyloglucan fucosyltransferase (XG 
Ftase) is an en#fne that fucosylates xyloglucan by adding a fucose residue to 
xyloglucar 
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Transgenic Plants: Transgenic plants are plants which contain DNA 
sequences which were introduced by transformation. 

Promoter: A promoter is the minimal DNA sequence sufficient to direct 
transcription. Promoters can render transcription controllable for cell-type specific, 
tissue-specific, or inducible expression. Promoter elements may be located in the 
5' or 3' regions of the native gene. 

Poly-A Addition Site: A poly-A addition site is a nucleotide sequence 
which causes certain enzymes to cleave mRNA at a specific site and to add a 
sequence of adenylic acid residues to the 3'-end of the mRNA. 

Polypeptide: Polypeptide means any chain of amino acids, regardless of 
length or post-translational modification (e.g., glycosylation or phosphorylation). 

Substantially Identical: For a polypeptide, substantially identical means a 
polypeptide exhibiting at least 50%, preferably 70%, more preferably 90%, and 
most preferably 95% identity to a reference polypeptide. For a nucleic acid 
substantially identical means a nucleic acid sequence exhibiting at least 85%, 
preferably 90%, more preferably 95%, and most preferably 97% identity to a 
reference nucleic acid sequence. For polypeptides, the length of comparison 
sequences will generally be at least 16 amino acids, preferably at least 20 amino 
acids, more preferably at least 25 amino acids, and most preferably 35 amino 
acids. For nucleic acids, the length of comparison sequences will generally be at 
least 30 nucleotides, preferably at least 60 nucleotides, more preferably at least 
75 nucleotides, and most preferably 110 nucleotides. 

Sequence identity is typically measured using sequence analysis software 
(e.g., Sequence Analysis Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
Wis. 53705). Such software matches similar sequences by assigning degrees of 
homology to various substitutions, deletions, substitutions, and other 
modifications. Conservative substitutions typically include substitutions within the 
following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, 
glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and 
phenylalanine, tyrosine. 
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Substantially Pur Polypeptid : Substantially pure polypeptide means a 
fucosyltransferase polypeptide which has been separated from components which 
naturally accompany it. Typically, the polypeptide is substantially pure when it is 
at least 60%, by weight, free from the proteins and naturally-occurring organic 
molecules with which it is naturally associated. Preferably, the preparation is at 
least 75%, more preferably at least 90%, and most preferably at least 99%, by 
weight fucosyltransferase polypeptide. A substantially pure fucosyltransferase 
polypeptide may be obtained, for example, by extraction from a natural source 
(e.g., a plant) by expression of a recombinant nucleic acid encoding a 
fucosyltransferase polypeptide; or by chemically synthesizing the protein. Purity 
can be measured by any appropriate method, e.g., column chromatography, 
polyacrylamide gel electrophoresis, or HPLC analysis. 

A protein is substantially free of naturally associated components when it is 
separated from those contaminants which accompany it in its natural state. Thus, 
a protein which is chemically synthesized or produced in a cellular system 
different from the cell from which it naturally originates will be substantially free 
from its naturally associated components. Accordingly, substantially pure 
polypeptides include, without limitation, those derived from eukaryotic organisms 
but synthesized in E. coli or other prokaryotes, or those derived from a eukaryotic 
cell which does not normally synthesize such a protein, or those derived from a 
eukaryotic cell engineered to overexpress such a protein. 

Substantially Pure DNA : Substantially pure DNA means DNA that is free 
of the genes which, in the naturally-occurring genome of the organism from which 
the DNA of the invention is derived, flank the gene. The term therefore includes, 
for example, a recombinant DNA which is incorporated into a vector; into an 
autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or 

a genomic or cDNA fragment produced by PCR or restriction endonuclease 

i 

digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding additional polypeptide sequence. 

Transformed Cell: Transformed cell means a cell into which (or into an 
ancestor of which) has been introduced, by means of recombinant DNA 
techniques, a DNA molecule encoding (as used herein) a fucosyltransferase 
polypeptide. 
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Positioned for Expr ssion: Positioned for expression means that the 
DNA molecule is positioned adjacent to a DNA sequence which directs 
transcription and translation of the sequence (i.e., facilitates the production of, 
e.g., a recombinant fucosyltransferase polypeptide or RNA molecule). 



Operably Linked: Operably linked mean that a gene and a regulatory 
sequence(s) are connected in such a way as to permit gene expression when the 
appropriate molecules (e.g., transcriptional activator proteins) are bound to the 
regulatory sequence(s). 



Purified Antibody: Purified antibody means an antibody which is at least 
60%, by weight, free from proteins and naturally-occurring organic molecules with 
which it is naturally associated. Preferably, the preparation is at least 75%, more 
preferably 90%, and most preferably at least 99%, by weight, antibody, e.g., a 
plant fucosyltransferase specific antibody. A purified fucosyltransferase antibody 
may be obtained, for example, by affinity chromatography using recombinantiy- 
produced fucosyltransferase protein or conserved motif peptides and standard 
techniques. 

Specifically Binds: Specifically binds means an antibody which 
recognizes and binds fucosyltransferase protein but which does not substantially 
recognize and bind other molecules in a biological sample. 

DETAILED DESCRIPTION OF THE INVENTION 

Taking into account these definitions, the present invention is directed to 
isolated, purified and cloned plant xyloglucan fucosyltransf erases. 



Plant Fucosyltransferase Purification 

A biochemical approach was utilized to purify sufficient quantities of 
xyloglucan fucosyltr insferases from pea epicotyls. Pea microsomes were 
prepared as follows: 2 cm segments, excised just below the apical hook, of 
etiolated Pisum sativum, cv Alaska were collected and homogenized in 1.5 
volumes buffer (50 nM Hepes pH 7.5, 1 mM EDTA pH 8.0, 0.4 M sucrose, 1 mM 
DTT, 0.1 mM PMSF 1 :g/mL each aprotinin, leupeptin, and pepstatin.) The 
homogenate was filt 3red, centrifuged at 2,000 x g for 15 minutes, and the 
supernatent was centrifuged at 100,000 x g for 1 hour. The resulting pellets were 
washed and homog mized in the presence of 0.1 M Na 2 C0 3 to strip away 
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15 



peripheral membrane proteins (Y.IFuJiki, A. L. Hubbard, S. Fowler, P. B. Lazarow, 
J. Cell Biol. 93, 97 (1982).) The s jspension was centrifuged at 100,000 x g for 1 
hour and the resulting pellets wen washed and resuspended in buffer (50 MM 
Pipes-KOH pH 6.2, 20% glycerol, 1 mM EDTA, 1 mM DTT, 0. 1 M PMSF, 1 :g/mL 
each aprotinin, leupeptin, and pep statin.) The suspension was homogenized, 
mixed with TritonX-100 to a final volume of 0.8%, and stirred for 1-2 h to solubilize 
membrane proteins. The suspension was centrifuged a final time at 100,000 x g 
for 1 h and the supernatent was ejected and saved. 

Arabidopsis cell suspension culture was also used as a tissue source. 
When Arabidopsis cell suspension culture was used, the purification procedure 
was the same except the cells were lysed an a French pressure cell at 4000 p.s.i. 

Pea carbonate-washed supernatents were pooled and separated on a 
GDP-HA agarose affinity chromatography column and GDP-binding proteins were 
eluted using excess free GDP. Protein levels were monitored by A280. The 
protein samples were desalted on a Sephadex G-25 column, concentrated, and 
further separated on a Phenomenex SEC 4000 size exclusion column. Some 
samples were further purified using a Poros QE or Resource Q anion exchange 
column and subsequently column and separated by SDS-PAGE electrophoresis. 



20 



y^J> Fucosyl Transferase Assay 



A specific 
asturtium stora 



25 



assay for fucosyl transferase was developed using tamarind or 
je xyloglucan, which naturally lack fucosyl residues, as an 
acceptor and rac iolabeled GDP-fucose as a donor [V. Farkas, G. Maclachlan, 
Arch, Biochem. Biophys. 264, 48 (1988). A. Camirand, D. Brummell, G. Maclachlan, 
Plant Physiol. 84, 753 (1 987)]. 




35 



Carbohydrate Analysis 

To confirm that the purified pea protein synthesizes an alpha-1 ,2 
fucosergalactose linkage, carbohydrate analysis was performed on the product 
resulting from in vitro fucosylatipn of tamarind xyloglucan by purified FTase . 
Carbohydrate linkage analysis of tamarind xyloglucan before (tamarind xg) and 
after (fucosylated xg) incubatior with purified pea FTase. Samples were 
incubated at room temperature or 20 minutes (for immunoprecipitation samples) 
or 30 minutes (for protein purification samples) with 25 mM Pipes-KOH pH 6.2, 



0.5 mg/mL tamarind xyloglucan 



0.05% 3 H GDP-fucose (3.7 mBq/mL, 300 



GBq/mM, NEN, Boston, MA). Most assays also contained 50 :M non-radiolabeled 
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GDP-fucose to provide a quantitative measurement of enzyme activity. Assays of 
immunoprecipitation samples also contained 5 mM MgCI 2 . Reactions were 
precipitated using 70% ethanolland 3 H incorporation was measured by scintillation 
counting. The amount of fucose incorporated into the product was used to 
calculate activity in nanokats (r Kat = nMoles substrated incorporated into product 
per second.) The resullts are s hown in Table 1 . 



Table I 



Sugar 


% 


% 


Residue 


tamarind xg 


Fucosylated xg 


4-glucose 


16.4 


17.5 


4,6-glucose 


37.0 


31.5 


t-xylose 


19.0 


13.5 


2-xylose 


15.0 


14.3 


t-galactose 


12.6 


5.5 


2-galactose 




9.0 


t-fucose 




8.7 



10 



Tamarind seed xyloglucan was fucosylated by 33 pKat size exclusion 
column-purified pea FTase (I mg/mL tamarind XG, 1.5 mM GDP-fucose, 50 mM 
Pipes-KOH pH 6.2.) XG product was precipitated with ethanol, resuspended in 

15 water, reprecipitated, and sent to the Complex Carbohydrate Research Center 
(kthens, GA) for linkage analysis. An equal amount of tamarind xyloglucan was 
also submitted for linkage analysis. Linkage analysis indicated that incubation of 
tXG with purified FTase resulted in a decrease in the mole percentage of terminal 
galactose and the appearance of 2-galactose and terminal facose, thus verifying 

20 the activity of the purified enzyme. 




25 



Peptide Sequencing 



It was po 



exclusion chroma ograpy step resulting in a total of 0.05 mg protein containing 70 



nKat XG FTase 
analysis, two pol} 



sible to purify XG FTase 1400-fold by the end of the size 



ictivity. After biochemical purification and subsequent assay 
peptides of approximately 65 kDa and 60 kDa in size were 
observed to co-pujify repeatedly with XG FTase activity (Figure 1 .) 
^\ t) Limited peptide sebuence was obtained from both proteins. Proteins in 

jT^s'ize exclusion column eluate fractions containing peak amounts of FTase activity 
30 were concentrated using £ Millipore 4 mL 10 kDa concentrator and separated by 



electrophoresis. After bri 
proteins were excised, rir 



*f staining with Coomassie and destaining the separated 
sed in 50% acetonitrile, stored at -80 Q C and sent to 
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Harvard Microchernis|y (Cambridge, MA) for tryptic peptide sequencing. Six 
peptide sequences wire obtained: VFGFLGR, YLLHPTNNVWGLVVR, 
AVLITSLSSGYFEK, YYDAYLAK, LLGGLLADGFDEK, and ESILPDVNR. 




Arabid k l opis C ST idgnfification 



Using th 
Arabidopsis ES" 



se peptides as a query in the Blastp program identified an 

', 191 A6T7, which contained four out of six peptides in a deduced 



translation of a potential ORF. The 65 kDa peptide was identified as a homolog of 
BiP, the usually ER-localized molecular chaparone. It is possible that this 
10 chaparone co-p jrified with FTase activity as an artifact and prevented the 

denaturation of the FTase during purification, though this has not been confirmed. 

Peptides from the lower molecular weight protein were not significantly 
similar to proteins of known function in databases, but did allow the identification 
of an Arabidopsis EST which, when translated, contains four out of six peptides 
1 5 with amino acid identity ranging from 63%-85% (See Table 2). 

The EST(number 191A6t7) was analyzed to determine if it was a full 
length clone. Northern blot analysis using the -900 bp-long 191A6T7 as a probe 
detected an approximately 2 kb transcript, indicating that the EST did not contain 
the full-length cDNA (RMP, data not shown.). 
20 191A6T7 was used as a probe to screen the CD4-15 portion of a size- 

fractionated Arabidopsis cDNA library at high stringency (J - J - Kieber, M. 
Rothenberg, G. Roman, K. A. Feldmann, J. R. Ecker, Cell 72, 427 (1993). 
Two cDNA clones were isolated, the longest containing a 1768 bp insert. Both 
lacked 13 nucleotides of the 3' UTR and the poly-A tail found in 191A6T7. There 
25 is an AATAAA consensus polyadenylation signal eight nucleotides from the 3' end 
of the library-derived clones. The sequence contains a 1698 nucleotide ORF that 
encodes a 63.7 kDa protein a 1698 nt open reading frame (Table 3) and 
correspond to a region of the fully sequenced Arabidopsis bacterial artificial 
chromosome (BAC) T18EI4 (Table 4). 
30 The cDNA and corresponding genomic clone have been designated 

AtFT1. Interestingly, analysis of the BAC indicates that there may be a second 
FTase approximately 600 bp downstream from AtFT1 which is -60% identical to 
AtFT1 (Table 5). Whether this second FTase is expressed, as well as splicing 
patterns and localization of the encoded protein, are matters of current 
35 investigation. It does raise the possibility that a multi-gene family of FTases may 
exist in Arabidopsis. We will determine whether members of such a family might 
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be differentially regulated by such factors as environmental stress, tissue 
localization, or developmental stage. Alternatively, there may well be FTases 
which have different acceptors, such as carbohydrate protein modifications. 




Antibody Preparation 

In order to confirm the identity of AtFT1 as encoding a xyloglucan-specific 
fucosyltransferase, we prepared polyclonal antibodies directed against AtFT1 and 
used them to immunoprecipitate proteins from carbonate-washed, solubilized 
Arabidopsis proteins. 

The portion ofjAtFTI encoding aa 73 to 566 was PCR-ampIified using 
appropriate primers and cloned into the pET28a expression vector (Novagen, 
Madison Wl.) The resulting insoluble fusion protein was purified by washing four 
times with 1% Triton X-100, 50 mM Hepes-KOH pH 7.6, 10 MM MgCI 2 and one 
time with 25 mM Hep< s-KOH pH 7.0, 8 M urea. The pellet was resuspended in 6 
M guanidine-HCI and >rotein was precipitated from the supernatent with 10% 
TCA. The protein was emulsified with Titermax adjuvent (CytRx Corporation, 
Norcross, GA) and injt cted into a rabbit. For western blotting, 40 :l of 
carbonate-washed sol ibilized protein from pea and Arabidopsis and 50 ng of 
purified antigen were separated by SDS-PAGE and electroblotted. Anti-AtFT1 
Abs (1 :5000) were used for western blotting. Goat-anti rabbit antibodies 
conjugated to horseradish peroxidase was used as a secondary antibody. Signals 
were detected by the enhanced chemiluminescence method (Pierce, Rockford, 
IL.) Membranes were |ien stained with Coomassie blue to detect protein. 




Immunoprecipitations 

For immunoprecipitations, solid NaCI was added to carbonate-washed 
solubilized Arabidopsis p otein to a final concentration of 200 mM. The 
Arabidopsis protein was f recleared by incubation with 1/10 volume of 50% slurry 
of protein A sepharose be ads (Pharmacia) in buffer A (25 mM Pipes-KOH pH 7.5, 
50 MM NaCI, 2 mM EDTA pH 8.0.) The resulting supernatents were incubated 
with 50 :l of immune or preimmune anti-AtFT1 serum for 1 h. 1/5 volume of protein 
A sepharose slurry was added to precipitate the antigen-antibody complexes and 
the samples were incubated for an additional 3 hours with rocking at 4 degrees C. 
Samples were then centrii uged, washed five times in buffer A containing 1% 
Triton X-100 and two times in buffer A without detergent. The pellets were 
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resuspended in buffer A to a final volume of 120 :l and assayed for AtFTase 
activity as describe^ above. 

The imrhunoprecipitated proteins were then assayed for XG FTase 
activity. More FTase activity was correlated with pellets derived from 
immunoprecipitaiion reactions using immune antiserum rather than preimmune 
serum, thereby irdicating that the Arabidopsis clone encodes a xyloglucan- 
specific FTase (Figure 2). 

Expression in COS Cells 

Cos-7 cellfc were grown on 100 mm plates in DMEM-10% Fetal Bovine 
Serum. Cells weie transfected with different plasmids using Lipofectamine™ 
reagent (Life Technologies) following the manufacturer's instructions using 9 :g of 
DNA and 72 :g of Lipofectamine. Cells were incubated for 24 hours in the 
medium containing DNA-Lipofectamine without Fetal Bovine Serum. The medium 
was changed to Dl /IEM-10% Fetal Bovine Serum and incubated for another 48 

re scraped off the dish in 0.25 M sucrose, 10 mM Tris-HCI pH 
S. XG-FTase activity was measured using 50 :g of protein in 
>r presence (+XG) of 100 :g tamarind xyloglucan. The 
incubation was carrsd out in a volume of 0.1 mL in the presence of 1 :M GDP-Fuc; 
(93,000 dpm), 10 m VI MnC12, 20 mM Hepes pH 7.0, 0.05% Triton X-100 at 25 Q C 
for 90 min. The reac tion was halted by adding ethanol to a final concentration of 
70%. Samples were incubated at 4 9 C and filtered through 1 .5 :m glass fiber 
filters. The filters were washed with 70% ethanol containing 1 mM EDTA. The 
filters were dried anc radioactivity determined by liquid scintillation. A control 
using pea Golgi vesicles was carried out in parallel. The results indicate that 
AtFT1 expressed in c COS cell line showed in vitro FTase activity that was 41 
times higher than CCS cells transformed with an empty vector and 1 .4 times 
higher than solubilized pea Golgi vesicles (Figure 2). 

Taken together, the in vitro translation data and the cos cell translation 
data provide strong evidence that AtFT1 is involved in xyloglucan biosynthesis. 



hours. The cells w 
7.5 and 0.4% CHA 
the absence (-XG) 



Sequence Analysis of AtFT1 

Analysis of AtFT1 indicates that, while it has some structural 
characteristics common to other fucosyltransf erases, it is quite divergent at the 
amino acid sequence level. Hydrophobicity plots predict that there may be a 



charac 
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N-terminal transmembrane signal anchor sequence (see Figure 3). In vitro 
translation in the preserce of canine pancreatic microsomes followed by 
carbonate washing of the products indicates that the AtFT1 translation product is 
a membrane protein (data not shown.) As with other glycosyltransf erases, the 
C-terminal region is predicted to be largely hydrophilic. 

AtFT1 is not significantly similar to any other FTases from other organisms, 
although multiple sequence alignments have identified three motifs which appear 
to be conserved among all alpha 1 ,2-FTases. One ([IV]G[IV]HQ][VI]R..[DN]) has 
been described previously (Breton et al., 1998). In addition, a second motif 
(D[EK]..F.[EQ].DQ) and a third hydrophobic region was conserved. 

Since these proteins have different acceptor molecules but share the same 
sugar nucleotide donor (GDP-fucose), it is possible that these regions are 
involved in GDP-fucose binding or that are necessary for assumption of 
conserved structural characteristics. Some small regions of similarity are 
observed between AtFT1 and NodZ, a fucosyltransferase in Rhizobium involved in 
the synthesis of nodulation factors. 




Other Glycosyltransferases 

The unique nature of this FTase allow its use as a tool for identifying other 
glycosyltransferases. Consideration of the number of different linkages present in 
plant cell wall polysaccharides indicates that there should be several hundred 
different glycosyltransferases involved in cell wall biosynthesis. Several other 
sequences in the Arabidopsis databases do appear to be similar to AtFT1 and 
AtFT2, and thus might represent a multi-gene family of FTases or 
glycosyltransferases. Substantially identical sequences are presented in Tables 
6-14. 

In addition to database analysis, the isolation of additional plant 
fucosyltransferases is made possible using standard molecular biology 
techniques. In particular, using all or a portion of the amino acid sequence of a 
plant fucosyltransferase of the invention, one may readily design 
fucosyltransferase oligonucleotide probes, including fucosyltransferase 
degenerate oligonucleotide probes (i.e., a mixture of all possible coding 
sequences for a given amino acid sequence). These oligonucleotides may be 
based upon the sequence of either strand of the DNA comprising the motif. 
General methods for designing and preparing suchprob^s^are provided, for 
example, in Ausubel et al., and Guide to Molecular Cloning Techniques, 1987, S. 
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L. Berger and A. R. Kimmel, eds., Academic Press, New York. These 
oligonucleotides are useful for fucosyltransferase gene isolation, either through 
their use as probes capable of hybridizing to fucosyltransferase complementary 
sequences or as primers for various polymerase chain reaction (PCR) cloning 
strategies. In one particular example, isolation of other fucosyltransferase genes 
is performed by PCR amplification techniques well known to those skilled in the 
art of molecular biology using oligonucleotide primers designed to amplify only 
sequences flanked by the oligonucleotides in genes having sequence identity to 
fucosyltransferase of the invention. The primers are optionally designed to allow 
cloning of the amplified product into a suitable vector. 

Hybridization techniques and procedures are well known to those skilled in 
the art and are described, for example, in Ausubel et al. If desired, a combination 
of different oligonucleotide probes may be used for the screening of the 
recombinant DNA library. The oligonucleotides are labelled with 32 P using 
methods known in the art, and the detectably-labelled oligonucleotides are used to 
probe filter replicas from a recombinant DNA library. Recombinant DNA libraries 
may be prepared according to methods well known in the art, for example, as 
described in Ausubel et al., supra, or may be obtained from commercial sources. 

For detection or isolation of closely related fucosyltransferases, high 
stringency conditions may be used; such conditions include hybridization at about 
42 degrees C. and about 50% formamide; a first wash at about 65 degrees C, 
about 2X SSC, and 1% SDS; followed by a second wash at about 65 degrees C. 
and about 0.1% SDS, 1X SSC. Lower stringency conditions for detecting 
fucosyltransferase genes having about 85% sequence identity to the 
fucosyltransferase gene described herein include, for example, hybridization at 
about 42 degree C in the absence of formamide; a first wash at about 42 degrees 
C, about 6X SSC, and about 1% SDS; and a second wash at about 50 degrees 
C, about 6X SSC, and about 1% SDS. 

Fucosyltransferase oligonucleotides may also be used as primers in PCR 
cloning strategies. Such PCR methods are well known in the art and described, 
for example, in PCR Technology, H. A. Erlich, ed., Stockton Press, London, 1989; 
PCR Protocols: A Guide to Methods and Applications, M. A. Innis, D. H. Gelfand, 
J. J. Sninsky, and T. J. White, eds., Academic Press, Inc., New York, 1990; and 
Ausubel et al., supra. If desired, fucosyltransferases may be isolated using the 
PCR "RACE " technique, or Rapid Amplification of cDNA Ends (see, e.g., Innis et 
al., supra). By this method, oligonucleotide primers based on a fucosyltransferase 
conserved domain are oriented in the 3' and 5' directions and are used to 
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generate overlapping PCR fragments. These overlapping 3'-and 5'-end RACE 
products are combined to produce an intact full-length cDNA. P.N.A.S. 
85:8998(1988) 

Plant Transformation 

Once identified, fucosyl transferase genes can be expressed in a variety of 
bells including/ plant cells, yeasts, fungi, bacterial cells and mammalian cells. A 
wide variety p\ plants can be transformed to express fucosyl transferase genes 
and genes /elated to fucosyl transferase in order to regulate plant carbohdrate 
10 glycosylatic 

A. Dicots 

Methods for transforming a wide variety of different dicots and obtaining 
transgenic plants are well documented in the literature (See Gasser and Fraley 
u 15 (1989) Science 244:1293; Fisk and Dandekar (1993) Scientia Horticulturae 55:5- 

j '3 36; Christou (1994) Agro Food Industry Hi Tech (March/April 1994) p. 17, and the 

Q 

I y references cited therein). 

l 4 



|,4 



B. Monocots 



I-* 

U 20 Methods for producing transgenic plants among the monocots are 

currently available. Successful transformation and plant regeneration have been 
achieved in asparagus (Asparagus officinalis: Bytebier et al. (1987) Proc. Natl. 
□ Acad. Sci. USA 84:5345); barley {Hordeum vulgare\ Wan and Lemaux (1994; 

% Plant Physiol 104:37); maize (Zea mays; Gordon-Kamm et al., (1990) Plant Cell 

U 25 2:603; Fromm et al. (1990) Bio/Technology 8:833; Koziel et al. (1993) 
Bio/Technology 11:194); oats (Avena sativa, Somers et al. (1992) Bio/Technology 
10:1589); orchardgrass (Dactylis glomerata] Horn et al. (1988; Plant Cell Rep. 
7:469); rice (Oryza sativa, including indica and japonica varieties; Toriyama et al. 
(1988) Bio/Technology 6:10; Zhang et al. (1988) Plant Cell Rep. 7:379; Luo and 
30 Wu (1988; Plant Mol. Biol. Rep. 6:165; Zhang and Wu (1988) Theor. Appl. Genet. 
76:835; Christou et al. (1991) Bio/Technology 9:957; rye (Secale cereale\ De la 
Pena et al. (1987) Nature 325:274); sorghum (Sorghum bicolon Cassas et al. 
(1993; Proc. Natl. Acad. Sci. USA 90:11212); sugar cane (Saccharum spp.\ 
Bower and Birch (1992) Plant J. 2:409); tall fescue (Festuca arundinacea\ Wang 
35 et al. (1992) Bio/Technology 10:691); turfgrass (Agrostis palustris; Zhong et al. 

(1993) Plant Cell Rep. 13:1)' wheat (Triticum aestivum; Vasil et al. (1992) 
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Bio/Technology 10:667; Troy Weeks et al. (1993) Plant Physiol. 102:1077; Becker 
et al. (1994) Plant J. 5:299). 




plant fucosyl transfe 
proteins into a plant, 
plasmid of Agrobac 



C. Expr ssion Vectors 

A variety of Expression vectors can be used to transfer the gene encoding 
ase activity as well as the desired promoters and regulatory 
Examples include but not limited to those derived from a Ti 
erium tumefaciens, as well as those disclosed by Herrera- 
Estrella, L, et al., Nature 303: 209 (1983), Bevan, M., Nucl. Acids Res. 12: 8711- 
8721 (1984), Klee, H J., Bio/Technology 3: 637-642 (1985), and EPO Publication 
120,516 (Schilperooit et al.) for dicotyledonous plants. Alternatively, non-Ti 
vectors can be use i to transfer the DNA constructs of this invention into 
monotyledonous plants and plant cells by using free DNA delivery techniques. 
Such methods may iravolve, for example, the use of liposomes, electroporation, 
microprojectile bombardment, silicon carbide wiskers, viruses and pollen. By 
using these methods! transgenic plants such as wheat, rice (Christou, P., 
Bio/Technology 9: 957t-962 (1991)) and corn (Gordon-Kamm, W., Plant Cell 2: 
603-618 (1990)) are produced. An immature embryo can also be a good target 

direct DNA delivery techniques by using the particle gun 
Physiol. 102: 1077-1084 (1993); Vasil, V., Bio/Technology 
10: 667-674 (1993); Wan, Y. and Lemeaux, P., Plant Physiol. 104: 37-48 (1994), 
and for AgrobacteriummediaXed DNA transfer (Hiei et al., Plant J. 6: 271-282 
(1994); Rashid et al., Riant Cell Rep. 15: 727-730 (1996); Dong, J., et al., Mol. 
Breeding 2: 267-276 (1996); Aldemita, R. and Hodges, T., Planta 199: 612-617 
(1996); Ishida et al., Nature Biotech. 14: 745-750 (1996)). In addition, plasmid 
pMEN020 is described in Figure 4. 



tissue for monocots foi 
(Weeks, T. et al., Plant 



D. Plant Regeneration 

After transformation of cells or protoplasts, the choice of methods for 
regenerating fertile plants is not particularly important. Suitable protocols are 
available for Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (Carrot, 
celery, parsnip), Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), 
Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, 
millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various 
other crops See protocols described in Ammirato et al. (1984) Handbook of Plant 
Cell Culture -Crop Species. Macmillan Publ. Co. Shimamoto et al. (1989) 
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Nature 338:274-276;; Fromm et al. (1990) Bio/Technology 8:833-839. ; Vasil et 
al. (1990)Bio/Technology 8:429-434. 

E. Carb hydrat s from Transg nic Plants 

5 Once transgenic plants are produced, carbohydrates can be isolated from 

the plants by procedures well known in the art. These purified carbohydrates are 
useful in agriculture as well medicine. 



10 



15 




20 



25 



F. New Complex Carbohydrates 

The enzymes involved in xyloglucan biosynthesis are reasonably stable 
and moderately abundant in plants. As such, these enzymes find use in 
synthesizing various types of complex carbohydrates under controlled conditions. 
It is also possible to make new complex carbohydrates that do not exist in 
Nature by procedures well known in the art. 

G. Carbohydrates as Herbicides 

Fucoxyloglgcan (XG) is the major hemicellulosic polysaccharide in the 
primary cell wall of fflicots. Monocots have small quantitities of XG, but it seems to 
be much less important in all monocots including grasses. XG has a backbone of 
beta- 1 ,4 linked glucqsyl reisdues, with three out of every four residues 



substitituted with xyl 
free. Approximately 



se in a regular repeat, i.e. three substituted followed by one 
Dne out of six of the xylosyl residues is further substituted 
with galactose and on the two position of galatose is an alpha linked 
fucosyl residue. Thu >, the fucose is a peripheral sugar in this polymer. 
However, the fucose las been postulated to be very important in determining 
the conformation of tt e polysaccharide, including controlling interactions 



of XG with cellulose 
function of this polysa 



30 



35 



hus, the presence of fucose may be important for the 
pcharide. 

yJ^^J^ Since XG is the major hemicellulosic polysaccharides in dicots, including 
1 many important weeds, but not very abundant in the cell walls of monocots, 

including corn, wheat! rice, barley, etc., inhibitors of XG synthesis may be valuable 
herbicides. Inhibitors include specific inhibitors of the enzyme itself and 
antisense constructs [for inhibiting expression of the protein. 

It appears that all of the enzymes that synthesize XG are part of a complex 
thereby permitting the use of XG-specific fucosyltransferase to identify other 
enzymes involved in XG synthesis. With the complete set of XG biosynthetic 
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enzymes in hand, rationale herbicide design is feasible by procedures well known 
in the art. 



:<=% 



b 
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Table 2 / 
Deduc (d Amino Acid Sequ nee from the cDNA / . 

mdqnsyrrrsspirtttggsksvnfsellqmkxlssgt 

^kltrtfttclivfsvlvafsmifhqhpsdsnrimgfaea 

rvldagvfpnvtninsdkllggllasgfdedsclsryqs 

vhyrkpspykpssyusklrnyeklhkrcgpgtesykka 

lkqldqehidgdgeckyvvwisfsglgnrilslasvflya 

lltdrvllvdrgkdmddlfcepflgmswllpldfpmtd 

qfdglnqessrcygymvknqvidtegtlshlyXhlvhd 

YGDHDKMFFCEGDQTFIGKVPWLIVKTDNYF/pSLWLIP 
GFDDELNKLFPQKATVFHHLGRYLFHPTNQVWGLVTRY 
YEAYLSHADEKIGIQVRVFDEDPGPFQHVMDQISSCTQK 
EKLLPEVDTLVERSRHVNTPKHKAVLVTSl/NAGYAENLK 
SMYWEYPTSTGEIIGVHQPSQEGYQQTEKKMHNGKALA 
E M YLLS LTD N LVTS A WST FG YV AQG LGG LK P W I L YR P E N 
RTTPDPSCGRAMSMEPCFHSPPFYDOKAKTGIDTGTLV 
PHVRHCEDISWGLKLV / 
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Table 3 

cDNAS qu nee 



ATGGATCAGAATTCGTACAGGAGAAGATCGTCTCCGAT 

CAGAACCACTACCGGCGGTTCAAAGTCCGTTAATTTCTC 

CGAACTACTTCAAATGAAGTATCTCAGCTCCGGTACGAT 

GAAGCTCACGAGAACCTTCACTACTTGCTTGATAGTCTT 

CTCTGTACTAGTAGCATTCTCAATGATCTTTCACCAACA 

CCCATCTGATTCAAATCGGATTATGGGTTTCGCCGAAGC 

TAGAGTTCTCGACGCCGGAGTTTTCCCAAATTCTGATAA 

GCTTCTCGGAGGGCTACTTGCTTCTGGTTTTGATGAAGA 

TTCTTGCCTTAGTAGGTACCAATCAGTTCATTACCGTAA 

ACCTTCACCTTACAAGCCATCTTCTTATCTCATCTCTAAG 

CTTAGAAACTACGAAAAGCTTCACAAGCGATGTGGTCC 

GGGTACTGAATCTTACAAGAAAGCTCTAAAACAACTTGA 

TCAAGAACATATTGATGGTGATGGTGAATGCAAATATGT 

TGTGTGGATTTCTTTTAGCGGCTTAGGGAACAGGATACT 

TTCTCTAG CCTCG GTTTTTCTTT ACG CG CTTTTAACG GAT 

AGAGTCTTGCTTGTTGACCGAGGGAAAGACATGGATGA 

TCTCTTTTGCGAGCCGTTTCTCGGTATGTCGTGGTTGCT 

ACCTTTAGATTTCCCTATGACTGATCAGTTTGATGGATTA 

AATCAAGAATCATCTCGTTGTTATGGATATATGGTGAAG 

AATCAGGTGATTGATACTGAGGGAACTTTGTCTCATCTT 

TATCTTCATCTTGTTCATGATTATGGAGATCATGATAAGA 

TGTTCTTCTGTGAAGGAGACCAAACATTCATCGGGAAAG 

TCCCTTGGTTGATTGTTAAAACAGACAATTACTTTGTTCC 

ATCTCTGTGGTTAATACCGGGTTTCGATGATGAACTAAA 

CAAGCTATTCCCACAGAAAGCGACTGTCTTTCATCACTT 

AGGTAGGTATCTTTTTCACCCAACTAACCAAGTATGGGG 

CTTAGTCACTAGATACTACGAAGCTTACTTATCGCATGC 

GGATGAGAAGATTGGGATTCAAGTAAGAGTTTTCGATGA 

AGACCCGGGTCCATTTCAGCATGTGATGGATCAGATTTC 

ATCTTGTACTCAAAAAGAGAAACTTCTACCTGAAGTAGA 

CACACTAGTGGAGAGATCTCGCCATGTTAATACCCCCAA 

ACACAAAGCCGTGCTTGTCACATCTTTGAACGCGGGTTA 

CGCGGAGAACTTAAAGAGTATGTATTGGGAATATCCGA 

CATCAACTGGAGAAATCATCGGTGTTCATCAGCCGAGC 

C AAG AAG GTT ATC AGC AG ACCG AA AAAAAG ATG CAT AAT 

GGCAAAGCTCTTGCGGAAATGTATCTTTTGAGTTTGACA 

GATAATCTTGTGACAAGTGCTTGGTCTACATTTGGATAT 

GTAG CTC AAG GTCTTG GAG GTTTA AAGCCTTG G ATACTC 

TATAGACCCGAAAACCGTACAACTCCCGATCCTTCGTGT 

G GTCG GG CTATGTCG ATG GAG CCTTGTTTCC ACTCG CC 

TCCATTCTATGATTGTAAAGCGAAAACGGGTATTGACAC 

GGGAACACTAGTTCCTCATGTGAGACATTGTGAGGATAT 

CAGCTGGGGACTTAAGCTAGT ATG A 



-23- 




PATENT 



Table 4 

Genomic Sequence 

BAC T18E12 Accession AC005313 nucleotides 41227-43179 (contains an 
intron). 

atggatcagaattcgtacaggagaagatcgtctccgatcagaaccactaccggcggttca 
aagtccgttaatttctccgaactacttcaaatgaagtatctcagctccggtacgatgaag 
ctcacgagaaccttcactacttgcttgatagtcttctctgtactagtagcattctcaatg 
atctttcaccaacacccatctgattcaaatcggattatgggtttcgccgaagctagagtt 
ctcgacgccggagttttcccaaatgttactaacatcagtatgtgttcttccaagtcaaag 
ttttgagctttattactttagatctcgttctttacactacgcatttgcctctgtatgtcc 
atagctcttggtcgatttcaatttgagatctatactcataaaaattgagtctttgtcagt 
cacaagactactatttttggtttgatgttgttttggtgaaaaagtgctcttttgttttgg 
tctcagcttagactgttacattcgttttttccgagttttttagattttgttctgattctg 
ttttgttttgtagattctgataagcttctcggagggctacttgcttctggttttgatgaa 
gattcttgccttagtaggtaccaatcagttcattaccgtaaaccttcaccttacaagcca 
tcttcttatctcatctctaagcttagaaactacgaaaagcttcacaagcgatgtggtccg 
ggtactgaatcttacaagaaagctctaaaacaacttgatcaagaacatattgatggtgat 
ggtgaatgcaaatatgttgtgtggatttcttttagcggcttagggaacaggatactttct 
ctagcctcggtttttctttacgcgcttttaacggatagagtcttgcttgttgaccgaggg 
aaagacatggatgatctcttttgcgagccgtttctcggtatgtcgtggttgctaccttta 
gatttccctatgactgatcagtttgatggattaaatcaagaatcatctcgttgttatgga 
tatatggtgaagaatcaggtgattgatactgagggaactttgtctcatctttatcttcat 
cttgttcatgattatggagatcatgataagatgttcttctgtgaaggagaccaaacattc 
atcgggaaagtcccttggttgattgttaaaacagacaattactttgttccatctctgtgg 
ttaataccgggtttcgatgatgaactaaacaagctattcccacagaaagcgactgtcttt 
catcacttaggtaggtatctttttcacccaactaaccaagtatggggcttagtcactaga 
tactacgaagcttacttatcgcatgcggatgagaagattgggattcaagtaagagttttc 
gatgaagacccgggtccatttcagcatgtgatggatcagatttcatcttgtactcaaaaa 
gagaaacttctacctgaagtagacacactagtggagagatctcgccatgttaataccccc 
aaacacaaagccgtgcttgtcacatctttgaacgcgggttacgcggagaacttaaagagt 
atgtattgggaatatccgacatcaactggagaaatcatcggtgttcatcagccgagccaa 
gaaggttatcagcagaccgaaaaaaagatgcataatggcaaagctcttgcggaaatgtat 
cttttgagtttgacagataatcttgtgacaagtgcttggtctacatttggatatgtagct 
caaggtcttggaggtttaaagccttggatactctatagacccgaaaaccgtacaactccc 
gatccttcgtgtggtcgggctatgtcgatggagccttgtttccactcgcctccattctat 
gattgtaaagcgaaaacgggtattgacacgggaacactagttcctcatgtgagacattgt 
gaggatatcagctggggacttaagctagtatga 
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Table 5 

Adjacent h m logous g ne on BAC 

BAC T18E12 Acc ssion AC005313 nude tides 43562-45245 

atgagaatcacagagatcttagctttgttcatggttttagtccctgtctcgctagtaatc 

gtagccatgtttggatatgatcaaggaaatggctttgtacaagcatctagattcataaca 

atggaaccaaatgtgacatcctcatcagatgattcatcactagtgcagagagatcaagaa 

caaaaaggtaaacttactttcttctttttgttttgaaatgtttctaaatttttctttgaa 

tgtttcatcagattctgtagatatgtctctgcttggagggctacttgtatctggtttcaa 

gaaagagtcttgcttgagtagataccaatcttacctctaccgtaaagcttcaccgtataa 

accttcgttgcatctactttcgaagcttagagcttacgaagagcttcataaaagatgtgg 

accgggaacaagacagtataccaatgcagaaagattgcttaaacagaaacaaacaggtga 

gatggaatcacaaggatgcaagtatgttgtttggatgtcgtttagcggattaggaaacag 

gattatcagtattgcttctgtgtttctgtatgcaatgttgacagatagagtcttgcttgt 

tgaaggaggggaacagttcgcggatttattctgcgaaccgttcctcgataccacttggtt 

actaccgaaagatttcaccttagctagtcagttcagtggctttggtcaaaactcagctca 

ctgccatggagatatgctgaagaggaaactgattaatgaatcctctgtttcgtctctgtc 

tcatctctatcttcatctagctcatgactacaatgagcacgacaaaatgttcttctgtga 

agaagatcaaaatctcttaaagaatgttccttggttgatcatgaggacaaacaacttctt 

tgcaccgtctcttttcttgatttcttctttcgaagaagagctcggtatgatgtttcccga 

gaaaggaacggtttttcaccatttaggtcgttaccttttccatccttcaaatcaagtctg 

gggactaatcacaagatactatcaagcttacttagccaaagctgatgaaaggattggtct 

tcaaataagagtctttgatgagaaatccggcgtatctcctcgagtcacaaagcaaatcat 

ttcgtgtgttcaaaacgagaatctgttaccgagactaagcaaaggtgaagaacaatacaa 

gcagccatcagaagaagagttgaaactcaaatctgtcttggtcacctctttaacaacagg 

atactttgagatcttgaaaacaatgtattgggaaaatccaactgtaacaagagatgtgat 

tggaatacatcagccaagtcatgaaggacatcaacaaacagagaagctaatgcataacag 

gaaagcttgggcagagatgtacttactcagcttaacggataagttggttattagtgcttg 

gtctacatttggttatgtagctcaaggacttggaggattaagagcttggattctgtataa 

acaagagaatcaaaccaacccaaatccaccttgcggtagagctatgtcaccagatccttg 

tttccatgctcctccttactatgattgcaaagcaaagaaaggaactgacactggtaatgt 

tgtcccgcatgttagacattgtgaagatattagctggggacttaagcttgttgacaactt 

ttag 

Protein translation: 

MRITEILALFMVLVPVSLVIVAMFGYDQGNGFVQASRFITMEPN 

VTSSSDDSSLVQRDQEQKDSVDMSLLGGLLVSGFKKESCLSRYQSYLYRKASP 

YKPSLLLSKLRAYEELHKRCGPGTRQYTNAERLLKQKQTGEMESQGCKYVVWM 

SFSGLGNRIISIASVFLYAMLTDRVLLVEGGEQFADLFCEPFLDTTWLLPKDFTLA 

SQFSGFGQNSAHCHGDMLKRKLINESSVSSLSHLYLHLAHDYNEHDKMFFCEE 

DQNLLKNVPWLIMRTNNFFAPSLFLISSFEEELGMMFPEKGTVFHHLGRYLFHPS 

NQVWGLITRYYQAYLAKADERIGLQIRVFDEKSGVSPRVTKQIISCVQNENLLPRL 

SKGEEQYKQPSEEELKLKSVLVTSLTTGYFEILKTMYWENPTVTRDVIGIHQPSH 

EGHQQTEKLMHNRKAWAEMYLLSLTDKLVISAWSTFGYVAQGLGGLRAWILYK 

QENQTNPNPPCGRAMSPDPCFHAPPYYDCKAKKGTDTGNVVPHVRHCEDISW 

GLKLVDNF 
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Tabl 6 



Arabidopsis thaliana cDNA clone 202C15T7, mRNA s quence [Arabidopsis 
thaliana] 

DNA Sequence 

TGTTCCATCNTTATGGTTTAATCCAACTNTCCAAACCGAACTAACGAAGCTGT 

TTCCGCANAAAGAAACCGTGTTTCACCACTTGGGTCGGNATCTTTTTNACCCT 

AAAAATCAAGTTTGGGATATCGTCACNAAGTACTACCATGNTCACTTATCCAA 

AGCAGATGNGAGACTCGGGATTCAAATTCGGGTTTTTNGCGATCAAGGTGGA 

TACNACCAACACGTCATGGACCAGGTCATATCCTGCACACA 



Translation of 202C15T7 in correct ORF. 

VPSLWFNPTXQTELTKLFPXKETVFHHLGRXLFXPKNQVWDIVTKYYHXHLSKA 
DXRLGI QIRVFXDQGGYXQHVMDQVISCT 
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Tabl 7 

Arabidopsis thaliana cDNA clone 170K19T7, mRNA sequence [Arabidopsis 
thaliana] 

5 

TGGNATTACAGATTACAAAGATACGAGGNTCTTCATAGACGTTGTGGACCATT 
CACTAGATCCTATAACTTAACACTTGACAAACTCAAGTCGGGAGATCGGTCTG 
ACGGTGAAGTTTCTGGTTGTAGATATGTAATATGGTTGANTTCCAATGGTGAT 
CTTGGGAATAGGATGCTGAGTCTAGCTTCANCTTTNCTTTATGCTCTCTTAAC 

1 0 AAATAGGTTTTNACTTGTCGAACTAGGAGTTGACATGGCTGATCTTTTCTNCG 
AGCCATTTCCAAACACTACTTGGTTTCTTCCCCCAGAGTTTCCGCTCAACAGC 
CACTTCAACGAGCAAGTCTCTTTCTAACGGAAATTNTTGGCAACCCCGATGG 
GTTCATAATCGNNCATGTAGTTCCGTNATTCCCAGTGNCCAACAAAAAGCTTT 
TTNTTTTTGNNAGGNTAGCCAAGTTTTTTTNGGGGAAACCCCCTGGTTGTCTT 

1 5 AAAANCGGGTAGNT TTTTTTTCCCAACTTTTTTTNA 
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Table 8 

T31J1TRTAMU Arabidopsis thaliana gen micclon T31J1, genomic survey 
sequ nee 

C A AG CTT ACA AG AA AG CAACG G AG ATTCTTGGTC ATG ATG ATG AG AATC ATTC 

AACCAAATCTGTTGGTGAATGCAGATACATTGTGTGGATTGCTGTTTATGGGC 

TAGGAAACAGAATACTTACTCTTGCTTCTCTGTTTCTCTATGCTCTCTTGACTG 

ACAGAATCATGCTTGTTGACCAACGTACGGACATAAGTGACCTCTTCTGTGA 

GCCTTTTCCAGGTACTTCCTGGCTACTCCCTCTGGATTTTCCACTAACAGATC 

AATTAGATAGCTTCAACAAGGAATCTCCGCGCTGTTACGGAACAATGTTGAA 

GAATCATGCCATTAACTCAACTACAACAGAAAGCATCATCCCCTCGTACCTCT 

GTCTTTATCTTATTCACGATTACGACGATTATGATAAGATGTTCTTCTGTGAAA 

GTGACCAAATTCTCATCAGGCAAGTCCCTTGGTTGGTCTTCAACTCGAATCTT 

TACTTTATCCCATCTCTATGGTTGATCCCTTCTTTTCAGTCAGAATTAAGCAAG 

CTATTCCCACAGAAAGAAACCGTCTTTCACCATTTGGCTCGCTATCTTTTTCA 

CCCGACTAACCAAGTTTGGGGCATGATCACAAGATCCTATAATGGGTATTTAT 

CAAGAGCTGATGAGAGACTTGGGATTCAAGT 



1 
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Tabl 9 



F16M20TR IGF Arabidopsis thaliana g nomic cl ne F16M20, g nomic survey 
s qu nc 

TTCTCCTTTTGACCTTTTTTTTTGTTATATGTTCAGACGAATCCGAAACACCGG 

GGCGGGATAGACTAATAGGAGGGCTTTTAACCGCAGATTTCGATGAAGGTTC 

TTGCTTGAGTAGGTATCATAAAACTTTCTTGTATCGCAAGCCTTCACCATACA 

AGCCGTCTGAATATCTTGTCTCGAAGCTTAGAAGCTATGAGATGCTTCACAAA 

CGTTGCGGTCCAGGGACAAAAGCTTACAAGGAAGCAACAAAGCATCTTAGTC 

ATGATGAGAATTATAATGCAAGCAAATCAGATGGTGAATGCCGATACGTTGTG 

TGGCTCGCTGATTACGGGCTTGGAAACCGACTACTCACTCTTGCTTCTGTGT 

TCCTCTACGCTCTCTTGACTGATAGAATCATTCTTGTTGACAACCGCAAGGAT 

ATTGGTGATCTCTTATGCGAGCCATTTCCAGGTACTTCATGGTTGCTTCCTCT 

CGACTTTCCATTGATGAAATATGCTGATGGATACCACAAGGGATACTCTCGTT 

GTTACGGAACAATGTTGGAAAATCATTCCATCAACTCGACTTCATTCCCGCCA 

CATCTATATAGGCATAACCTTCATGATTCAAGGGATAGTGATAAGATGTTCTT 

CTGCCAAAAAGATCAAAGTTTGATTGACAAAGTCCCT 
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Tabl 10 

F16A14-T7 IGF Arabidopsis thaliana genomic clone F16A14, genomic surv y 
s qu nc 

GGGGGGGATGGTTACTGACTCCTATATGCCGAATCTTTGACATCTCTGTTTC 

AATGGCCACAATCCTATTGAATCAGCTATATTAAAGAAAATTATAACTCATCAA 

ATAGCTTAAGACCATCGTTCCCACGATCCTCACAATGCCTTNCNAGAGGAAC 

TACCTTCCCGGAGTTAGTTCCCCATTCGGGTTCACATCCATGAGACGGAAGA 

GTAAGGTGACNATGGTCCATCGACGTGGATTGAATACNCTGTGGATCAGGAG 

CTGTACGACCTGCTGGCTGATAAAGTAACCATGGCTTTAATCCTCCAAGAATA 

TGAGCAACATATCCNAATGTAGACCTTGCACTTGTGACTATTTTATCAGTTAG 

ACTTAGAAGATACNTCTCGGCGAGCGCCTTTTGGTCGTGTANCTTCTTGTCTT 

NTGTTGAACCCTTTCTCCACTTGGCTGATNAACTTCAATGATCTCCCCTGCTG 

AACTCGGTCGTTCCCAATACATGTTCTNTAAGGTNTCAGAGTACTCTGGATAC 

NAAGATGTGACNAGAACAGCTNTAAGTGTCTGGCTTCTTGAATATATGACTTT 

TGGCTCTTCTTGTGCACCTTGTTCAGGCAAAAGGTCTCTCTTCCTGTCCAACT 

TACAACTTGATCCNTTNCCTGTTAANATTTCCCCCTCGAATGCTGAACTACCC 

CTTCTCTAATAACCNNCCTCTCCTCCGCTCCTGAATAACTTCGGCTTGCTAGA 

ATTCTCTCATTCACCTCCCCACTTGAACCC CCCCGCGGTACAAACC 
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Table 11 

T26M12-T7 TAMU Arabid psis thaliana g nomic clone T26M 12, genomic 
survey sequ nee [Arabidopsis thaliana] 

ATTCGTGATGAGTACTATGCAAGCGAATCAAATGGTGACTGCAGATACATTGT 

ATGGCTAGCTAGGGACGGGCTTGGAAACAGATTAATTACTCTTGCTTCCGTG 

TTTCTCTACGCTATCTTGACTGAGAGAATCATTCTTGTTGACAACCGCAAGGA 

TGTTAGTGATCTCTTATGTGAGCCATTTCCAGGTACTTCATGGTTGCTTCCGC 

TTGACTTTCCAATGCTGAATTATACTTATGCTTATGGCTACAATAAGGAATACC 

TCGTTGTTACNGTACAATGTTGGAAAATCATGCCATCAACTCGACTTCAATTC 

CGCCACATCTATATCTCCATAACATCCATGAATCTAGGGATANTGATAAGCTG 

TTCTTCTGCCAAAANGGATCAAAGTTTTTTATCGACANATTTCCATGGGTAAAT 

TAATTCANAACCAATGCCTTACTTTGGTTCCCAATCTTTATGGGCTGAAATCC 

CANCTTTTCCAN ACCAAAAACTAAGTTTAAGCTTATCCCCGGCAGAAAAGG 
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Tabl 12 

ATTS3691 Gif-S dA Arabid psis thaliana cDNA cl ne YAY241 , mRNA 
sequence [Arabidopsis thaliana] 

A ATG GTG ATCTTGGG A AT AG G ATG CTG AGTCT AG CTTC AG CTTTTCTTT ATGC 
TCTCTTAACAAATAGGTTTTAACTTGTCGAACTAGGAGTTGACATGGCTGACC 
TTTTCTGCAAGCCATTTCCAAACACTACTTGGTTTCTCCCCCCAGAGTTTCCG 
CTCAACAGCCACTTCAACGAGCAGTCTCTTCTACGCAATTCTGGCAACCCGA 
TGGTTGCATATCGACATGTAGTTCGTGAATTCCAGTGACCAACAAAAGCTTTT 
CTTTTGTGAGGATAGTCAAGTTTTGT 
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Tabl 13 T22N7TRB TAMU Arabidopsis thaliana g nomiccl n T22N7, 
gen mic survey sequ nce[Arabid psis thaliana] 

CAAGCTTCGAGACAAGATATTCAGACGGCTTGTATGGTGAAGGCTTGCGATA 
CAAGAAAGTTTTATGATACCTACTCAAGCAAGAACCTTCATCGAAATCTGCGG 
TTAAAAGCCCTCCTATTAGTCTATCCCGCCCCGGTGTTTCGGATTCGTCTGAA 
C ATATAAC A A AA A AA AAG GTC AAAAG G AG AATTCTTTG AG CTAACA ATG 
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Tabl 14 

31198 Lambda-PRL2 Arabidopsis thaliana cDNA clone 170K19XP 3', mRNA 
s quenc [Arabidopsis thaliana] 

AAANNCCTTAANCAANTTTTACCGAANTCAAGGCGTTTACCCACTTCTCNCCN 

GGTTTTAAGGTTCAGGGCNNTTTTTGGNAACCCNACAGTGATGGNGAGTTAT 

CCGCGTTCACAANCCGACTACAAGGCTTCCAAAAACCCCCGNGGAACNTGG 

AANTTAAGAGANCATGGCTGAGATATACCTTCTGAGTTGTTCTGATGCNCTG 

GTGGTCACAGGTTTATGGTCCTCACTCGTGGAGGTTGCCTCATGGCCTTGGA 

GGGTTGAAGCCATGNGTGTTGAACAAAGCTGAGAATGGGACTGCCCATGAG 

CCTTACTGTGTGAAAGCAAGATCAATAGAGCCCTGTTCCCAAGCGACATTGT 

TCCATGGCTGTAAAGATTGAAACATGAATAGAGTCTCGAGGGCTTTTTTTGCC 

TTTAATAGATGTTGTACGGTCAAGAATTTCAGAGTTGCCCAATAGACACGTAA 

GGAATATTAGGATTAACTATGTATCAGTTCATGACTTGATCGAGTTCTATATTC 

TTTTCAAT 
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